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ABSTRACT 

CQA services are collaborative platforms where users ask 
and answer questions. We investigate the influence of na¬ 
tional culture on people’s online questioning and answering 
behavior. For this, we analyzed a sample of 200 thousand 
users in Yahoo Answers from 67 countries. We measure em¬ 
pirically a set of cultural metrics defined in Geert Hofstede’s 
cultural dimensions and Robert Levine’s Pace of Life and 
show that behavioral cultural differences exist in community 
question answering platforms. We find that national cul¬ 
tures differ in Yahoo Answers along a number of dimensions 
such as temporal predictability of activities, contribution- 
related behavioral patterns, privacy concerns, and power in¬ 
equality. 

Categories and Subject Descriptors 

K.4.0 [Computers and Society]: General; J.4 [Social 
and Behavioural Sciences]: Sociology 
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1. INTRODUCTION 

Cultural differences exist in almost all aspects of social 
interactions. For example, in some cultures in Asia it may be 
considered disrespectful for people to express their opinions 
or ask questions to authority figures (e.g., teachers, elders). 
In other cultures (such as USA or Canada) asking questions 
is expected or even encouraged. 

Cross-country cultural variations have been studied in the 
real world via small-scale experiments and opinion surveys. 
Geert Hofstede 13 administered opinion surveys to a large 


number of IBM employees from different countries in the 
1960s and 1970s. He discovered five cultural dimensions 
(individualism, power distance, uncertainty avoidance, mas¬ 
culinity, and long term orientation), that can be attributed 
to the existence of cultural variations. Three of these di¬ 
mensions, individualism, power distance, and uncertainty 
avoidance, have been used to assess cultural differences in 
online contexts such as Twitter communication [^, emoticon 
usage 27 and online scheduling [^. In brief, individual¬ 


ism reflects the extent to which an individual is integrated 


into a group (e.g., individualistic cultures like USA empha¬ 
sizes mostly on their individual goals, as opposed to col¬ 
lectivist cultures like China that emphasizes on group har¬ 
mony and loyalty). Power distance is the extent to which the 
less powerful members of an organization or society expect 
and accept that power is distributed unequally (e.g., in high 
power distance countries subordinates simply comply with 
their leaders). Uncertainty avoidance defines the extent to 
which society members feel uncomfortable with uncertainty 
and ambiguity (e.g., the stereotypical Swiss plans everything 
ahead supposedly to avoid uncertainty). 

Psychologist Robert V. Levine proposed the Pace of 
Life metric based on the walking speed of city people over 
a distance of 60 feet, the service time for standard requests 
for stamps, and the clock accuracy of city banks. During 
the 1990s, Levine employed 19 experimenters in large cities 
from 31 countries and computed country-specific Pace of Life 
ranks. He found significant differences in Pace of Life across 
cultures and ranked the cultures based on that. 

Such cross-cultural variations that sociologists and psy¬ 
chologists already found in the offline world lead to our 
main research question: Does national culture determine 
how we participate in online Community Question Answer¬ 
ing (CQA) platforms? CQAs such as Yahoo Answers {YA), 
Quora and Stack Overflow have been popular in the last 
decade. These platforms are rich and mature repositories 
of user-contributed questions and answers. For example, 
YA, launched in December 2005, has more than one billion 
posted answersj^and Quora, one of the fastest growing CQA 
sites has seen three times growth in 2013[^ 

National cross-cultural variations have been studied in a 
number of online contexts, including social networks (e.g., 
Twitter [^, Facebook [29] ), location search and discov¬ 
ery (e.g.. Foursquare 34 ) and online scheduling (e.g.. Doo¬ 
dle [30|) . While CQA platforms have been intensively stud¬ 
ied [32[ |m |28[ |16| , to the best of our knowledge, there has 
been no study focusing on users’ cultural differences and how 
they shape asking, answering, or reporting abuses in CQA 
platforms. If cultural variations exist in CQA platforms, 
they could be used for more informed system design, includ- 


^ http: //www.yanswersbloguk.com/b4/2010/05/04/1- 
billion-answers-served / 

^http://www.goo.gl/MfK83y 






ing question recommendation, follow recommendation, and 
targeted ads. 

To fill this gap, we analyzed about 200 thousand sam¬ 
pled YA users from 67 countries who were active between 
2012 and 2013. We tested a number of hypotheses associ¬ 
ated with Hofstede’s cultural dimensions and Levine’s Pace 
of Life. Our results show that YA is not a homogeneous sub¬ 
cultural community: considerable behavioral differences ex¬ 
ist between the users from different countries. We find that 
users from individualistic countries provide more answers, 
have higher contribution than take away, and are more con¬ 
cerned about their privacy than those from collective cul¬ 
tures. Lfsers from individualistic countries are also less likely 
to provide an answer that violates community norms. We 
also find that higher power distance countries show more 
indegree imbalance in following relationships compared to 
lower power distance countries. Finally, we find that users 
from higher Pace of Life and lower uncertainty avoidance 
countries have more temporally predictable activities. 

The rest of the paper is structured as follows. Section 
discusses previous analysis of CQA platforms and the exist¬ 
ing body of work on online cultural variations. Section 
presents the YA functionalities relevant to this study and 
the dataset used. We introduce the hypotheses and present 
the results relating to Levine’s Pace of Life and Hofstede’s 
cultural dimensions in YA in Sectionj^and Sectionj^ respec¬ 
tively. We discuss the impact of these results in Section]^ 

2. RELATED WORK 

Colder and Macy studied collective mood in Twit¬ 
ter across countries from 509 million Twitter posts by 2.4 
million users over a 2-year period. Despite having differ¬ 
ent cultures, geographies, and religions, all countries (USA, 
Canada, UK, Australia, India, and English-speaking Africa) 
in their study showed similar mood rhythms—people tended 
to be more positive on weekends and early in the morning. 
Park et al. examined the variation of Twitter users’ 
emoticon usage patterns in cross cultures. They used Hofst¬ 
ede’s national culture scores of 78 countries and found that 
collectivist cultures favor vertical and eye-oriented emoti¬ 
cons, where people within individualistic cultures favor hor¬ 
izontal and mouth-oriented emoticons. Hofstede’s cultural 
dimensions have also been used to study whether culture of 
a country is associated with the way people use Twitter . 
In another study on cross-country Twitter communication, 
Carcia et al. showed that cultural variables such as Hofst¬ 
ede’s indices, language and intolerance have an impact on 
Twitter communication volume [^. 

Silva et al. used food and drink check-ins in Foursquare 
to identify cultural boundaries and similarities across pop¬ 
ulations. They showed that online footprints of foods and 
drinks are good indicators of cultural similarities between 
users, e.g., lunch time is the perfect time for Brazilians to 
go for slow food places more often, whereas Americans and 
English people go for slow foods more at dinner time. Ex¬ 
tracted features like these allow them to apply simple clus¬ 
tering algorithms such as K-means to draw cultural bound¬ 
aries across the countries. 

Quercia [29| used Satisfaction With Life tests and mea¬ 
sured happiness of 32,787 Facebook users from 12 coun¬ 
tries (Australia, Canada, France, Germany, Ireland, Italy, 
New Zealand, Norway, Singapore, Sweden, UK, USA ). He 
found that despite comparative economic status, country- 


level happiness significantly varies across the countries and 
that it strongly correlates with official well-being scores. 

Reinecke et al. used about 1.5 million Doodle polls 
from 211 countries and territories and studied the influence 
of national culture on people’s scheduling behavior. Using 
Hofstede’s cultural dimensions, they found that Doodle poll 
participants from collectivist countries find more consensus 
than those from predominantly individualist societies. 

CQA platforms have also attracted much research inter¬ 
est focusing on content, user behavior and applications. Re¬ 
search on CQA content has analyzed textual aspects of ques¬ 
tions and answers. Researchers have proposed algorithmic 
solutions to automatically determine the quality of ques- 
and answers [^. Research on CQA user behav¬ 
ior has been mostly about understanding why users con¬ 
tribute content: that is, why users ask questions (askers 
are failed searchers, in that, they use CQA sites when web 
search fails [^); and why they don’t answer questions (e.g., 
they refrain from answering sensitive questions to avoid be¬ 
ing reported for abuse and potentially lose access to the 
community i)- As for applications, researchers have pro¬ 
posed effective ways of recommending questions to the most 
appropriate answerers [^, automatically answering ques¬ 
tions based on past answers [33| , and retrieving factual an¬ 
swers or factual bits within an answer [^. Our previous 
work used user-provided rule violation reports and user 
social network features to detect the content abusers in YA. 

However, there has been no empirical cross-cultural anal¬ 
ysis of CQA platforms. This paper is a first step in this 
direction and it verifies whether cultural differences are man¬ 
ifested in one such platform, YA. 


tions 23 


3. YAHOO ANSWERS 

After 9 years of activity, YA has 56M monthly visitors 
(U.S. only)[^ The functionalities of the YA platform and 
the dataset used in this analysis are presented next. 

3.1 The Platform 

YA is a CQA platform in which community members ask 
and answer questions on various topics. Users ask questions 
and assign them to categories selected from a predefined 
taxonomy, e.g.. Business & Finance, Health, and Politics & 
Government. YA has about 1300 categories. Users can find 
questions by searching or browsing through this hierarchy 
of categories. A question has a title (typically, a short sum¬ 
mary of the question), and a body with additional details. 

A user can answer any question but can post only one 
answer per question. Questions remain open for four days 
for others to answer. However, the asker can select a best 
answer before the end of this 4-day period, which automat¬ 
ically resolves the question and archives it as a reference 
question. The best answer can also be rated between one to 
five, known as answer rating. If the asker does not choose 
a best answer, the community selects one through voting. 
The asker can extend the answering duration for an extra 
four days. The questions left unanswered after the allowed 
duration are deleted from the site. In addition to questions 
and answers, users can contribute comments to questions 
already answered and archived. 


^http: //www.listofsearchengines.org/qa-search-engines 





VA has a system of points and levels to encourage and 
reward participation|^ A user is penalized five points for 
posting a question, but if she chooses a best answer for her 
question, three points are given back. A user who posts an 
answer receives two points; a best answer is worth 10 points. 

A leaderboard, updated daily, ranks users based on the to¬ 
tal number of points they collected. Users are split into seven 
levels based on their acquired points (e.g., 1-249 points: level 
1, 250-999 points: level 2, ..., 25000-1- points: level 7). These 
levels are used to limit user actions, such as posting ques¬ 
tions, answers, comments, follows, and votes: e.g., first level 
users can ask 5 questions and provide 20 answers in a day. 

YA requires its users to follow the Community Guidelines 
that forbid users to post spam, insults, or rants, and the 
Yahoo Terms of Service that limit harm to minors, harass¬ 
ment, privacy invasion, impersonation and misrepresenta¬ 
tion, fraud and phishing. Users can flag content (questions, 
answers or comments) that violates the Community Guide¬ 
lines and Terms of Service using the “Report Abuse” func¬ 
tionality. Users click on a flag sign embedded with the con¬ 
tent and choose a reason between violation of the commu¬ 
nity guidelines and violation of the terms of service. Re¬ 
ported content is then verified by human inspectors before 
it is deleted from the platform. 

Users in VA can choose to follow other users, thus cre¬ 
ating a follower-followee relationship used for information 
dissemination. The followee’s actions (e.g., questions, an¬ 
swers, ratings, votes, best answer, awards) are automati¬ 
cally posted on the follower’s newsfeed. In addition, users 
can follow questions, in which case all responses are sent to 
the followers of that question. 

3.2 Dataset 

We studied a random sample of about 200k users from 
VA who were active between 2012 and 2013. These users 
posted about 9 million questions (about 45 questions/user), 
43 million answers (about 215 answers/user), and 4.5 million 
abuse reports (about 23 reports/user). They are connected 
via 490k follower-followee relationships in a social network. 
The indegree and outdegree distributions of the social net¬ 
work follow power-law distributions, with an exponential ht- 
ting parameter a of 1.83 and 1.85, respectively. 

In our dataset, we have users from 67 countries. Figure[^ 
shows the number of users in our dataset as a function of the 
number of Internet users taken from the World Banklf] We 
find a linear relationship between the number of users per 
country in our VA dataset and the number of Internet users 
in the World Bank dataset for each country. It means that 
the VA users from our dataset are not skewed by country. 
Instead, they represent a sample of global Internet users. 

To investigate how sensitive this correlation is to the num¬ 
ber of users per country, we computed the Pearson correla¬ 
tion between the number of VA users in x countries and their 
respective internet population. The x countries were ranked 
based on the number of VA users found in the dataset, and 
X was varied from top 20 to all 67 countries. Figurej^shows 
that there are several peaks in the correlation, but the values 
are high and between 0.5 and 0.7. We select as a threshold 
the second highest correlation peak and thus included in the 
study 41 countries which have at least 150 users per country. 


^https: / /answers .yahoo, com / info / scoring_system 
® http://www.data.worldbank.org/indicator/IT.NET.USER.P2 



Figure 1: The number of Internet users and YA 
users for 67 countries. The regression line and 
95% confidence interval area are also shown. The 
countries are represented by a 2-letter country code 
based on ISO 3166. 



Figure 2: Number of top countries based on the 
number of YA users and correlation with their num¬ 
ber of Internet users. All correlations are statisti¬ 
cally significant with p-value<0.05. 

4. LEVINE’S PACE OF LIFE 

In this section, we analyze Levine’s Pace of Life cultural 
dimension in the context of VA and show how it relates to 
user activities such as questioning, answering and reporting. 
In his book [^, psychologist Robert Levine defines Pace of 
Life as “the flow or movement of time that people experi¬ 
ence”. With the help of 19 experimenters, he collected and 
compared three indicators of the Pace of Life in 36 large 
cities from 31 countries around the world during a warm 
summer month between 1992 and 1995 [^. The indicators 
are: 

• Walking speed: They measured walking speed of 
35 men and 35 women over a distance of 60 feet in 
main downtown areas in each city. Measurements were 
done during prime business hours after controlling a 
number of variables such as sidewalks, crowd, effects 
of socialization. They found significant differences in 
pedestrians walking speed—for example, pedestrians 
in Rio de Janeiro, Brazil walked only two-thirds as 
fast as pedestrians in Zurich, Switzerland. 






• Postal speed: In each city, they measured the time 
it took postal workers to serve a standard request for 
stamps and considered this time as a proxy for work 
speed. They handed each clerk money and a note 
written in the local language requesting a common 
stamp. For example, in the United States, the clerk 
was handed a 5 dollar bill with a request for one 32- 
cent stamp. They found that overall Western Europe 
was the fastest to serve a standard request. 


each country has on average 134k questions, 642k answers 
and 67k abuse reports. Normalized entropy ranges from 0 
to 1. A normalized question entropy close to 0 indicates 
that most of the questions the user asked are within one 
time interval of the day, whereas the closer to 1, the more 
likely is that the user asked questions during all intervals. 
Finally, the question/answer/report entropy for a country 
c, Entropyq/a/r,c, is defined as the geometric mean of all 
Entropyg/a/r,u computed for the users of that country: 


• Clock accuracy: To quantitatively measure time con¬ 
cerns, the researchers checked the clock accuracy of 
randomly selected 15 downtown banks in each city. 
The reported times were then compared to those re¬ 
ported by the telephone company, which was consid¬ 
ered accurate. 

Levine combined these three scores into a country-specific 
Pace of Life score and concluded that “the Pace of Life was 
fastest in Japan and the countries of Western Europe and 
was slowest in economically undeveloped countries. The 
pace was significantly faster in colder climates, economically 
productive countries, and in individualistic cultures” [21| . 

Intuitively, to cope with the rigid perception of time, peo¬ 
ple from the higher Pace of Life countries have to be planned 
and organized in their daily activities. On the other hand, 
people from lower Pace of Life countries might allow some 
unstructured activities, as in those countries the expectation 
of following the ‘right’ time is more relaxed. 

Applying these findings to online communities such as 
CQA platforms, we expect that people from higher Pace 
of Life countries, such as the USA, will be less likely to ask 
or answer questions during busy hours of the day, e.g., office 
hours. From these ideas, we hypothesize the following in YA-. 

[HI] Users from countries with a higher Pace of Life 
score show more temporally predictable activities. 

To test this hypothesis, we calculate how probable a coun¬ 
try’s users are in asking, answering and reporting at different 
times of day and correlate that with that country’s Pace of 
Life rank. For example, if a user only asks or answers ques¬ 
tions in the evening, he is temporally more predictable than 
a user who asks or answers in the morning, afternoon and 
night. In a Twitter study [^, Colder and Macy also find 
diurnal mood rhythms in different cultures. 

In order to calculate temporal predictability, we only con¬ 
sider working days, as weekends are less predictable. More 
specifically, similar to [^, we divide the working day in 
five time intervals: morning (6:00 - 8:59), office time (9:00- 
17:59), evening (18:00-20:59), late night (21:00- 23:59), sleep¬ 
ing time (00:00 - 05:59). All the reported times are users’ 
local time. We use information entropy [^, a measure of 
disorder, to calculate the temporal predictability. 

For a given activity (asking, answering, or reporting) and 
C intervals, we can compute p{c), the probability of an ac¬ 
tivity belonging to interval c. We measure the normalized 
entropy for user u for all activities as: 


Entropyu 


-Ec6c7’(c)^°S'(p(c)) 

\logC\ 


( 1 ) 


We calculate users’ normalized entropies for all their ques¬ 
tions, answers and abuse reports and refer to them as ques¬ 
tion, answer and report entropy, respectively. In our dataset. 


Entropyg/a/r,c = ( 11 ^ ( 2 ) 

U^Uc 

where Uc is the set of users in country c. We use geometric 
mean to account for the skewed distribution of the entropy 
scores, something that the regular arithmetic mean cannot 
handle. 

Table shows Pearson correlations between question, an¬ 
swer, report entropy and Pace of Life ranks, where lower 
ranks mean higher Pace of Life. For both questions and 
answers, the overall Pace of Life ranks have positive corre¬ 
lations with question and answer entropy with r = 0.67 and 
r = 0.37, respectively. These positive relationships are seen 
in the Figuresj^and We find positive correlations between 
walking speed rank, post office service time rank, and clock 
accuracy time rank with question entropy with r = 0.48, 
r — 0.60 and r = 0.48, respectively. For answers, we find 
positive correlations between post office service time rank, 
and clock accuracy time rank with entropy with r = 0.38 
and r = 0.29, respectively. However, we do not find any 
statistically significant relationships between report entropy 
and Pace of Life ranks. 

These results confirm that users from countries with a 
higher Pace of Life score show more temporally predictable 
asking and answering behavior in YA. 



Entropy 

Pace of Life 

Question 

Answer 

Report 

Overall 

0.67*** 

0.37* 

0.18 

Walking speed 

0.48** 

0.18 

0.06 

Post office 

0.60** 

0.38* 

0.19 

Clock accuracy 

0.48** 

0.29* 

0.21 


Table 1: Pearson correlations between question, an¬ 
swer, report entropy and Pace of Life rank. Lower 
ranks mean higher Pace of Life, p-values are indi¬ 
cated as: p<0.005(***), p<0.05 (**), p<0.1 (*). 


5. HOFSTEDE’S CULTURAL DIMENSIONS 

In this section, we analyze a number of cultural dimen¬ 
sions in YA proposed by Geert Hofstede. We show how three 
cultural dimensions defined by Hofstede—individualism, power 
distance and uncertainty avoidance are manifested in the 
ecosystem of YA. 

Hofstede’s cultural dimensions theory is a framework for 
analyzing cultural variability. In his original model [12| , 
Hofstede proposed four primary dimensions by surveying 
in the 1960s and 1970s a large number of IBM employ¬ 
ees from 40 countries: power distance (PDI), individual¬ 
ism (IDV), uncertainty avoidance (UAI) and masculinity 
(MAS). Later [o, he added two more dimensions: long¬ 
term orientation (LTO) and indulgence versus restraint (IVR). 













Overall rank of Pace of Life 

Figure 3: Pace of Life overall rank vs. average ques¬ 
tion entropy per country. Countries shown are the 
ones in our dataset for which a Pace of Life rank has 
been published. Countries are ranked in decreasing 
order of their Pace of Life value. A regression line 
is also shown. 
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Overall rank of Pace of Life 

Figure 4: Pace of Life overall rank vs. average an¬ 
swer entropy per country. Countries shown are the 
ones in our dataset for which a Pace of Life rank has 
been published. Countries are ranked in decreasing 
order of their Pace of Life value. A regression line 
is also shown. 


Three of the dimensions, individualism, power distance, and 
uncertainty avoidance, have been used in a number of recent 
studies of online behavior |27[ . We also use these three 

Hofstede’s cultural dimensions and relate them to a number 
of hypotheses in the context of VA. 

5.1 Individualism (IDV) 

Individualism is the extent to which an individual is inte¬ 
grated into a group. In individualistic societies (high IDV) 
such as the USA and England, personal achievements and 
individual rights are emphasized; an individual is expected 
to take care of only himself and his immediate family. In col¬ 
lectivist countries such as those of India, China, and Japan, 
individuals are expected to place the family and group goals 
above those of self. In this work, we investigate how individ¬ 
ualism is related to users’ contribution, (un)ethical behavior 
and privacy settings in VA. 


Individualism and contribution. The usage of the 
Internet takes time from a number of daily activities in¬ 
cluding face-to-face socialization. In collectivist countries, 
people are expected to give a fair amount of time on socia¬ 
bility, hence traditionally they seem to spend less time on 
the Internet compared to the people from the individualistic 
cultures . In VA we expect that users from individualistic 
countries spend more time online, hence they can provide 
more answers and eventually they can contribute more to 
the community than their direct benefits from the commu¬ 
nity. We hypothesize the following: 

[H2] Users from countries with higher individualism 
index provide more answers. 

[H3] Users from countries with higher individualism 
index contribute more to the community than what 
they take away from the community. 

We correlate the geometric mean of the number of answers 
posted by the users from a country with that country’s indi¬ 
vidualism index (a higher score means higher individuality). 
We use geometric mean as an average because of the skewed 
distributions of the number of answers. In the calculation of 
the geometric mean, we exclude the users who have not pro¬ 
vided any answers. We observe a positive correlation, shown 
in Figure with r = 0.46, p < 0.005. This means that, on 
average, users from individualistic countries provide more 
answers. 
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Figure 5: Individualism index vs. the average num¬ 
ber of answers posted by users per country. A re¬ 
gression line is also shown. 

To quantify users’ contribution compared to their take 
away, we compute yielding scores of the users. The yielding 
score of a user is simply a difference between his contribu¬ 
tion and his take away. For yielding scores, we consider 
K4’s point system, which awards two points for an answer, 
ten points for a best answer, and penalizes five points for a 
question: 


Yielding,^ = f (contribution) — f (takeaway) 

= 2.0* Au+ 10.0* BAu-5.0* Qu 

where Qu is the number of questions posted by user u, Au is 
the number of answers posted by u, and BAu is the number 
of best answers posted by u. 

Finally, a country’s yielding score Vieldinga is defined as 
the geometric mean of all Vieldingu computed for the users 





of each country c: 


Yielding^ = ^ Yielding^ (4) 

u£Ua 

where Uc is the set of users in country c and we take only 
those users having yielding scores more than zero. We cor¬ 
relate a country’s geometric mean of the yielding score with 
the country’s individualism index and we obtain a positive 
correlation (Figure]^ with r = 0.37,p < 0.05. This result 
suggests that the more individualistic a country is, the more 
its users contribute to YA than what they take away from 
the community. 


that the countries with higher collectivist scores show the 
most corruption. 

Based on this online and offline user unethical behavior 
that is influenced by culture, our intuition is that we could 
observe a similar trend in YA. In CQA platforms, the expec¬ 
tation is that users would provide helpful answers to posted 
questions. As such, users are required to follow the Com¬ 
munity Guidelines and the Yahoo Terms of Service while 
answering. When users post bad answers, community mem¬ 
bers flag them. Later, human moderators check whether 
these flags are applied correctly or not. We expect that the 
more collective a culture is, the more probable the answers 
from its users will be flagged as abusive. Formally, we hy¬ 
pothesize that: 
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Figure 6: Individualism index vs. yielding score per 
country. A regression line is also shown. 


There might be multiple explanations about why users 
from individualistic countries contribute more to the com¬ 
munity as reflected by hypotheses H2 and H3. One expla¬ 
nation is that individualistic cultures have a more favorable 
collaborative environment [^, so individuals feel the urge 
to contribute to the community. Another explanation could 
be that users from individualistic cultures simply want more 
points than collectivist cultures. As points are awarded for 
contribution (e.g., an answer earns two points) and partic¬ 
ipation (e.g., each login earns one point), users might be 
tempted to contribute more. In fact, we obtain a positive 
and significant correlation (r = 0.42, p < 0.01) between a 
country’s points (calculated as geometric mean of the coun¬ 
try’s user points) and its individualism index. Finally, there 
might be other confounding factors (e.g., internet penetra¬ 
tion) that affect the contribution of a country’s users on the 
platform. Thus, it is difficult to confirm whether the users’ 
behavioral differences on contribution are due to their cul¬ 
tural differences. 

Individualism and (un)ethical behavior. The degree 
to which a culture is collectivist or individualistic has an im¬ 
plication on its users’ online (un)ethical behavior. For ex¬ 
ample, the more individualistic (less collectivistic) a culture, 
the lower the rate of software piracy and online music 
piracy [^. Personal rights are paramount in individualis¬ 
tic cultures, where people do not feel obligated to engage 
in group cooperation that involves conspiracy. Group co¬ 
operation and conspiracy are two key elements for the real 
world unethical behaviors such as corruption [26[ . Trian- 
dis et al. used Hofstede’s individualism index and found 


[H4] Users from more collective (less individualis¬ 
tic) cultures have higher probability to violate CQA 
norms. 

To this end, for each user u, we first calculate Pu, the 
probability that his answers violate community norms (and 
thus are correctly flagged by other users): 

f( correctly flagged answers from u 
0 total answers from u 

Finally, Pc, the geometric average of all pu probabilities 
computed for each country c: 

a = (n!>.)^ (6) 

uGUc 

where Uc is the set of users in country c. 

The Pearson correlation r = —0.48,p < 0.05 shows that 
the probability of abuses in answers provided by the users 
from a particular country is negatively correlated with that 
country’s individualism index. Figure indeed shows that 
the probability decreases with an increasing individualism 
index, meaning that if an answer comes from an individual¬ 
istic country, it is less probable to violate community rules. 
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Figure 7: Individualism index vs. the probability 
that an answer from a country is correctly flagged. 
A regression line is also shown. 

Individualism and privacy concerns. Although on¬ 
line privacy concerns are global, the extent to which people 
perceive these concerns as real varies across cultures. For 
example, in the United States, privacy is a basic human 
right, endorsed by the American Bill of Rights, while Asian 
countries show little or no recognition on privacy in their 








legal systems A survey of 1261 Internet users from five 
big cities—Bangalore, Seoul, Singapore, Sydney and New 
York—shows that Internet users from individualistic cul¬ 
tures are more concerned about privacy than those in collec¬ 
tive cultures [^. We expect that a similar trend also exists 
in CQA platforms. We hypothesize that: 

[H5] Users from higher individualism index countries 
exhibit higher level of concern about their privacy. 

We use the modifications of the privacy settings on users’ 
YA accounts as a proxy of privacy concern. In YA, pri¬ 
vacy settings are typically available for users to personalize 
for content (questions or answers) and follower-followee net¬ 
work. Intuitively, privacy-concerned users would take the 
opportunity to change the default privacy settings. So, we 
consider the fraction of public privacy profiles in a coun¬ 
try to draw a conclusion on how concerned its users are 
about their privacy. However, the default privacy in YA is 
public. It might be possible that many of the users in the 
public group are dormant: users who signed up, asked and 
answered some questions, and disappeared quickly. These 
users might skew the results of our study, thus, we only con¬ 
sider active users from our dataset— users who have asked 
and answered more than 10 questions during our observation 
interval. These active users are about 79% of our dataset. 
We note that our conclusions remain the same if we con¬ 
sider more active users by filtering users who have asked 
and answered more than 20 questions. 

Based on Hofstede’s Individualism index, the Hofstede 
Centr^ has tagged countries as individualistic or collec¬ 
tivist. In our study, we use this classification. Figure 
shows the percentage of user profiles with public privacy set¬ 
tings in a country, as function of the country’s ranking in the 
collectivist and individualistic class. The figure shows that, 
on average, collectivist countries have a higher percentage 
of public profiles: collectivist countries such as Spain, Peru, 
Argentina, and Mexico have higher percentage of public pro¬ 
files than individualistic countries such as United Kingdom, 
United States, Australia or Italy. 
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Figure 8: Percentage of public privacy settings vs. 
rank of collectivist and individualistic countries, re¬ 
spectively. Country ranks are based on the percent¬ 
age of public privacy settings and they are sepa¬ 
rately done for collectivist and individualistic coun¬ 
tries. 


® http: / /geert-hofstede .com / countries. html 


5.2 Power Distance Index (PDI) 

PDI is the extent to which the less powerful members of 
an organization or society expect and accept that power is 
distributed unequally. This dimension sheds light on how 
a society handles inequalities among its members. In coun¬ 
tries with high PDI, such as countries from Latin, Asian, 
African and Arab world, everybody has a place in the social 
hierarchy and people accept the situation without question¬ 
ing it. However, in Anglo and Germanic countries, which 
are low power distance countries, people seek distribution of 
power and ask for justifications of power inequality. 

PDI essentially measures the distribution of wealth and 
power between people in a country or culture. In YA, we 
can use the indegree (number of followers) as a proxy of 
wealth and power. For example, the larger the number of 
followers users have, the larger an audience they have for di¬ 
rect communication. Higher indegree users are also found to 
be more central (thus more retained [^) across a number 
of network centrality metrics [^. Moreover, these users’ 
questions are forwarded to more users, hence more likely to 
be getting an answer. A study on YA shows that users 
receive more answers from close neighborhoods. Given the 
high number of questions that remain unanswered (42% in 
YA reported by a study [^) in GQA platforms, bringing 
answers not only shows a user’s potential capability, but 
also makes the platform mature and informative. Taking 
ideas from the unequal distribution of wealth and power in 
higher power distance countries, we expect that in YA, users 
from those countries also have Inequality in their indegrees. 
Garcia et al. have found similar indegree inequality in 
Twitter. We hypothesize the following in YA: 

[H6] Users from higher power distance countries show 
a larger indegree imbalance in follow relationships. 

We correlate countries power distance index (higher in¬ 
dex means power distance is high) with their users’ inde¬ 
gree imbalance. A user’s indegree imbalance is calculated 
as the difference between her friends’ average indegree and 
her indegree. Finally, a country’s indegree imbalance is the 
geometric mean of the indegree imbalance of its users. 

For all countries, except Panama and Philippines, we ob¬ 
tained a positive indegree imbalance, meaning that for those 
countries, on average, a user’s contacts have more contacts 
than the user. This supports a well-known hypothesis friend¬ 
ship paradox in sociology. The friendship paradox states 
that your friends have on average more friends than you 
have, however, most people think that they have more friends 
than their friends have [^. It has been shown that the para¬ 
dox holds for both Twitter and Facebook [^. Now we 
also show it for YA. 

Figure shows the relation between PDI and indegree 
imbalance (excluding Panama and Philippines). The figure 
indeed shows a positive correlation. We obtained a positive 
correlation r = 0.65, p < 0.005 between indegree imbalance 
and PDI for all countries (including Panama and Philip¬ 
pines). This supports the hypothesis that users from coun¬ 
tries with higher PDI are more comfortable with indegree 
imbalance. 

5.3 Uncertainty Avoidance Index (UAI) 

UAI is the extent to which people feel uncomfortable with 
uncertainty and ambiguity. Individuals from countries ex- 





Figure 9: Power distance index vs. indegree imbal¬ 
ance. A regression line is also shown. 

hibiting strong UAI tend to minimize uncertainty and am¬ 
biguity by careful planning, and enforcing rules and regula¬ 
tions. On the other hand, low uncertainty avoidance cultures 
maintain a more relaxed attitude in unstructured situations. 

For example, Switzerland has a reasonably high uncer¬ 
tainty avoidance index (58) compared to countries such as 
Singapore (8) and Sweden (29). In fact, an online scheduling 
behavior study on Doodle (http://doodle.com/) shows 
that Switzerland and Germany have a high advance plan¬ 
ning time of 28 days. In YA, our related hypothesis is: 

[H7] Users from countries with higher uncertainty 
avoidance index exhibit more temporally predictable 
activities. 

Figures 1 1 ![ \T ^ show the relationship between question, 
answer and abuse report entropy vs. uncertainty avoidance 
index, respectively. Note that a higher UAI means lower 
uncertainty and ambiguity. The negative relations in the 
figures indicate that users from countries with higher uncer¬ 
tainty avoidance index tend to have lower question, answer 
and abuse report entropies, thus they are more temporar¬ 
ily predictable. All the entropies have negative relation 
to uncertainty avoidance index: r = —0.43 for questions, 
r = —0.55 for answers, and r = —0.51 for abuse reports. All 
correlation values are statistically significant with p < 0.05. 



Figure 10: Question entropy vs. uncertainty avoid¬ 
ance index. Only countries having more than 300 
users are plotted. A regression line is also shown. 
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Figure 11: Answer entropy vs. uncertainty avoid¬ 
ance index. Only countries having more than 300 
users are plotted. A regression line is also shown. 
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Figure 12: Report entropy vs. uncertainty avoid¬ 
ance index. Only countries having more than 300 
users are plotted. A regression line is also shown. 

6. SUMMARY AND DISCUSSION 

Observing the global spread of information and commu¬ 
nication technologies, researchers sometimes predicted that 
the online world would be converging into a “one-world cul¬ 
ture” . With the advent of the large-scale online behav¬ 
ioral datasets in the past decade from online platforms like 
Twitter, Facebook and Foursquare, researchers showed that 
the Internet does not have a homogeneous culture. Instead, 
country-specific cultural variations do exist. We showed 
the same non-homogeneity, but in a very different online 
context—community question answering. 

In this work, we analyzed about 200 thousand sampled 
Yahoo Answers users from 67 countries. We studied users’ 
behavioral patterns such as temporal predictability of activ¬ 
ities, engagement, (un)ethical behavior, privacy concerns, 
and power inequality and how they compare with a number 
of cultural dimensions (Pace of Life, Individualism, Uncer¬ 
tainty Avoidance and Power Distance). We find that behav¬ 
ioral differences exist across cultures in YA. Table [2] shows 
a summary of all the hypotheses involving cultural indices 
and the results found. 

We acknowledge that our study is observational and lacks 
controlled experimental ground truth data. Therefore, we 
cannot draw causal conclusions whether cultures shape the 
ecosystem of YA. However, our results hint at the impor¬ 
tance of culture-aware CQA moderation. Note that CQA 








Pace of Life 

Users from countries with a higher Pace of Life score show more temporally predictable activities (asking, answering and 
reporting) 

Correlation 

r, = 0.67*** 
= 0.37** 

Tr = 0.18 

Individualism 

Users from higher individualism index countries provide more answers 

Users from countries with higher individualism index contribute more to the community than what they take away from 
the community 

Users from more collective (less individualistic) cultures have higher probability to violate CQA norms 

Users from higher individualism index countries exhibit higher level of concern about their privacy 

Correlation 

r = 0.46*** 
r = 0.37** 

r = -0.48** 

NA 

Power distance 

Users from higher power distance countries show larger indegree imbalance in follow relationships 

Correlation 

r - 0.65*** 

Uncertainty Avoidance 

Users from countries with higher uncertainty avoidance index exhibit more temporally predictable activities (asking, an¬ 
swering and reporting) 

Correlation 

r, = -0.43** 
ra = -0.55** 
Tr = -0.51** 


Table 2: Pearson correlation coefficients in hypotheses related to pace of life, individualism, uncertainty 
avoidance and power distance, p-values are indicated as: p<0.005(***), p<0.05 (**), p<0.1 (*). 


platforms like YA employ human moderators to evaluate 
reported abuses and determine the appropriate responses, 
from removing content to suspending user accounts. We 
find that collective cultures are more probable to provide 
bad answers. At a minimum, more attention of moderators 
are expected in these cultures to keep the environment clean. 

We find that individualistic cultures are more engaged in 
YA, e.g., by providing more answers and contributing more 
than their take away. These results confirm the generaliza¬ 
tion that individualistic cultures are highly attracted to the 
Internet. Researchers often attribute the egalitarian, demo¬ 
cratic nature of the Internet to this engagement 

The evidence of different engagement patterns and differ¬ 
ence in pace of life across cultures in CQA platforms imply 
that some core functionalities such as question recommenda¬ 
tion and follow recommendation could benefit from exploit¬ 
ing cultural factors. In question recommendation, questions 
are routed to the most appropriate answerers. To find out 
such answerers, factors such as followers, interests, question 
diversity and freshness 36 are considered. Our study sug¬ 
gests that including cultural variables such as individualism 
can be useful. For example, as users from collective cul¬ 
tures are less probable to answer, questions from those com¬ 
munities should be routed to a larger number of potential 
answerers. 

Another variable. Pace of Life, could also be a factor in 
question recommendation. Our results show that users from 
countries with a higher pace of life are temporally more pre¬ 
dictable. In those cultures, if questions are forwarded to 
answerers during the busy hours of the day (e.g., during of¬ 
fice hours), the questions are less likely to get an answer. 
Solutions could include routing questions to a larger num¬ 
ber of potential answerers, diversifying the set of answerers 
to include users from countries with a lower Pace of Life, or 
delaying routing for after work hours. 

In the follow recommendation, CQA platforms recommend 
which other users one can follow based on shared interests, 
common contacts, and other related factors. We find that in 
YA, lower power distance countries show less indegree imbal¬ 
ance in follow relationships. For follow recommendation in 
those countries, users to be followed may be recommended 
to a user with the same level of indegree as them. 

CQA platforms could also exploit cultural differentiations 
to improve targeted ads. Okazaki and Alonso 25 analyzed 


online advertising appeals such as “soft sell” appeal (that 
works by creating emotions and atmosphere via visuals and 
symbols) and “hard sell” appeal (that provides focus product 
features, explicit information, and competitive persuasion) 
across a number of cultures. They found individualistic cul¬ 
tures like the USA are more attracted to “hard sell” appeal, 
where collective cultures like Japan are attracted to “soft 
sell” appeal. Ju-Pak’s study also confirms that fact- 
based appeal is dominant in the USA, but text-limited, vi¬ 
sual layouts are popular in collective cultures like South Ko¬ 
rea. Linguistic aspects in the ads might also be important. 
For example, focusing on ‘P, ‘me’ in individualistic cultures 
and ‘us’ and ‘we’ in collective cultures. Finally, CQA sites 
could leverage cultural variations in their platforms by, for 
example, placing textual, informative feature ads to users 
from individualistic cultures and visual and symbolic ads to 
users from collective cultures. 


7. ACKNOWLEDGMENTS 

The work was funded by the National Science Foundation 
under the grant CNS 0952420, and by the Yahoo’s Faculty 
Research and Engagement Program. 

8. REFERENCES 

[1] J. Bian, Y. Liu, E. Agichtein, and H. Zha. Finding the 
right facts in the crowd: Factoid question answering 
over social media. In Proceedings of the 17th 
International Conference on World Wide Web, pages 
467-476, 2008. 

[2] H. Cho, M. Rivera-Sanchez, and S. S. Lim. A 
multinational study on online privacy: global concerns 
and local responses. New media & society, 
11(3):395-416, 2009. 

[3] T. M. Cover and J. A. Thomas. Elements of 
information theory 2nd edition. Willey-Interscience: 
NJ, 2006. 

[4] M. De Mooij. The future is predictable for 
international marketers: converging incomes lead to 
diverging consumer behaviour. International 
Marketing Review, 17(2): 103-113, 2000. 

[5] M. De Mooij. Global marketing and advertising: 
Understanding cultural paradoxes. Sage Publications, 
2013. 












[6] D. Dearman and K. N. Truong. Why users of yahoo!: 
Answers do not answer questions. In Proceedings of 
the SIGCHI Conference on Human Factors in 
Computing Systems, pages 329-332, 2010. 

[7] S. L. Feld. Why your friends have more friends than 
you do. American Journal of Sociology, pages 
1464-1477, 1991. 

[8] R. Garcia-Gavilanes, Y. Mejova, and D. Quercia. 
Twitter ain’t without frontiers: Economic, social, and 
cultural boundaries in international communication. 

In Proceedings of the 17th ACM conferenee on 
Computer supported cooperative work & social 
computing, pages 1511-1522, 2014. 

[9] R. Garcia-Gavilanes, D. Quercia, and A. Jaimes. 
Cultural dimensions in twitter: Time, individualism 
and power. Proceedings of the 7th International AAAI 
Conference on Weblogs and Social Media, 13, 2013. 

[10] S. A. Colder and M. W. Macy. Diurnal and seasonal 
mood vary with work, sleep, and day length across 
diverse cultures. Science, 333(6051):1878-1881, 2011. 

[11] N. O. Hodas, F. Kooti, and K. Lerman. Friendship 
paradox redux: Your friends are more interesting than 
you. In Proceedings of the 7th International AAAI 
Conference on Weblogs and Social Media, 2013. 

[12] G. Hofstede. National cultures in four dimensions: A 
research-based theory of cultural differences among 
nations. International Studies of Management & 
Organization, pages 46-74, 1983. 

[13] G. Hoftede, G. J. Hofstede, and M. Minkov. Cultures 
and organizations: software of the mind: intercultural 
cooperation and its importance for survival. 
McGraw-Hill, 2010. 

[14] B. W. Husted. The impact of national culture on 
software piracy. Journal of Business Ethics, 
26(3):197-211, 2000. 

[15] K.-H. Ju-Pak. Content dimensions of web advertising: 
a cross-national comparison. International Journal of 
Advertising, 18(2):207-231, 1999. 

[16] I. Kayes, N. Kourtellis, D. Quercia, A. lamnitchi, and 
F. Bonchi. The social world of content abusers in 
community question answering. In Proceedings of the 
24 th International World Wide Web Conference, 
pages 570-580, 2015. 

[17] I. Kayes, X. Qian, J. Skvoretz, and A. lamnitchi. How 
influential are you: Detecting influential bloggers in a 
blogging community. In Proceedings of the fth 
international conference on Soeial Informatics, pages 
29-42, 2012. 

[18] I. Kayes, X. Zuo, D. Wang, and C. Jacob. To blog or 
not to blog: Characterizing and predicting retention in 
community blogs. In Proceedings of the 7th ACM/ASE 
International Conference on Social Computing 

(SocialCom’14), 2014. 

[19] E.-J. Ki, B.-H. Chang, and H. Khang. Exploring 
influential factors on music piracy across countries. 
Journal of Communication, 56(2):406-426, 2006. 

[20] R. Levine. A Geography of Time: The Temporal 
Misadventures of a Social Psychologist or How Every 
Culture Keeps Time Just a Little Bit Differently. 
University Press, 2006. 


[21] R. V. Levine and A. Norenzayan. The pace of life in 
31 countries. Journal of cross-cultural psychology, 
30(2): 178-205, 1999. 

[22] T. Levitt. The globalization of markets. International 
Business: Strategic management of multinationals, 
3:18, 2002. 

[23] B. Li, T. Jin, M. R. Lyu, 1. King, and B. Mak. 
Analyzing and predicting question quality in 
community question answering services. In Proceedings 
of the 21st International Conference Companion on 
World Wide Web, pages 775-782, 2012. 

[24] Q. Liu, E. Agichtein, G. Dror, Y. Maarek, and 

1. Szpektor. When web search fails, searchers become 
askers: Understanding the transition. In Proceedings 
of the 35th International ACM SIGIR Conference on 
Research and Development in Information Retrieval, 
pages 801-810, 2012. 

[25] S. Okazaki and J. Alonso. Right messages for the right 
site: on-line creative strategies by Japanese 
multinational corporations. Journal of Marketing 
Communications, 9(4):221-239, 2003. 

[26] H. Park. Determinants of corruption: A cross-national 
analysis. Multinational Business Review, ll(2):29-48, 
2003. 

[27] J. Park, Y. M. Baek, and M. Cha. Cross-cultural 
comparison of nonverbal cues in emoticons on twitter: 
Evidence from big data analysis. Journal of 
Communication, 64(2):333-354, 2014. 

[28] M. Qu, G. Qiu, X. He, C. Zhang, H. Wu, J. Bn, and 
C. Chen. Probabilistic question recommendation for 
question answering communities. In Proceedings of the 
18th International Conference on World Wide Web, 
pages 1229-1230, 2009. 

[29] D. Quercia. Don’t worry, be happy: The geography of 
happiness on facebook. In Proceedings of the 5th 
Annual ACM Web Science Conference, pages 316-325, 
2013. 

[30] K. Reinecke, M. K. Nguyen, A. Bernstein, M. Naf, 
and K. Z. Gajos. Doodle around the world: Online 
scheduling behavior reflects cultural differences in 
time perception and group decision-making. In 
Proceedings of the conference on Computer supported 
cooperative work, pages 45-54, 2013. 

[31] M. Richardson and R. W. White. Supporting 
synchronous social q&a throughout the question 
lifecycle. In Proeeedings of the 20th international 
conference on World wide web, pages 755-764, 2011. 

[32] C. Shah and J. Pomerantz. Evaluating and predicting 
answer quality in community qa. In Proceedings of the 
33rd International ACM SIGIR Conference on 
Research and Development in Information Retrieval, 
pages 411-418, 2010. 

[33] A. Shtok, G. Dror, Y. Maarek, and 1. Szpektor. 
Learning from the past: Answering new questions with 
past answers. In Proceedings of the 21st International 
Conference on World Wide Web, pages 759-768, 2012. 

[34] T. H. Silva, P. O. Vaz de Melo, J. M. Almeida, 

M. Musolesi, and A. A. Loureiro. You are what you 
eat (and drink): Identifying cultural boundaries by 
analyzing food and drink habits in foursquare. In 
Proceedings of the 8th International AAAI Conference 
on Weblogs and Social Media, 2014. 



[35] J. J. Sosik and D. I. Jung. Work-group characteristics 
and performance in collectivistic and individualistic 
cultures. The Journal of social psvcholoqy, 

142(l):5-23, 2002. 

[36] I. Szpektor, Y. Maarek, and D. Pelleg. When 
relevance is not enough: Promoting diversity and 
freshness in personalized question recommendation. In 
Proceedings of the 22Nd International Conference on 
World Wide Web, pages 1249-1260, 2013. 

[37] H. C. Triandis, P. Carnevale, M. Gelfand, C. Robert, 
A. Wasti, T. Probst, E. Kashima, T. Dragonas, 

D. Chan, X. Chen, et al. Culture, personality and 
deception: A multilevel approach. International 
Journal of Cross-Cultural Management, 1:73-90, 2001. 

[38] J. Ugander, B. Karrer, L. Backstrom, and C. Marlow. 
The anatomy of the facebook social graph. arXiv 
preprint arXiv:1111.4503, 2011. 

[39] I. Weber, A. Ukkonen, and A. Cionis. Answers, not 
links: Extracting tips from yahoo! answers to address 
how-to web queries. In Proceedings of the 5th ACM 
International Conference on Web Search and Data 
Mining, pages 613-622, 2012. 



